Performance Analysis of Gang Scheduling in a Distributed System under Processor Failures
نویسنده
چکیده
In this paper we study the performance of a distributed system that is subject to hardware failures and subsequent repairs. A special type of scheduling called gang scheduling is considered, under which jobs consist of a number of interacting tasks which are scheduled to run simultaneously on distinct processors. Two different gang scheduling policies used to schedule parallel jobs are examined in two cases: In the blocking case, a job that is blocked due to processor failure keeps all of its assigned processors until the failed processor is repaired. In the nonblocking case, the remaining operable processors can serve other jobs. Furthermore, this paper combines two different I/O scheduling methods with the two gang scheduling policies, so that three policy combinations are used. The impact of the variability in processor service time is also studied. Various degrees of multiprogramming, coefficients of variation of processor service time and failure to repair ratios are examined using simulation techniques.
منابع مشابه
Gang Scheduling in a Distributed System under Processor Failures and Time-Varying Gang Size
In this paper we study the performance of a distributed system which is subject to hardware failures and subsequent repairs. A special type of scheduling called gang scheduling is considered, under which jobs consist of a number of interacting tasks which are scheduled to run simultaneously on distinct processors. System performance is examined and compared in cases where different distribution...
متن کاملPerformance Analysis of a Distributed System under Time-Varying Workload and Processor Failures
This paper studies the performance of a distributed system which is subject to hardware failures and subsequent repairs. A special type of scheduling called gang scheduling is considered, under which jobs consist of a number of interacting tasks which are scheduled to run simultaneously on distinct processors. The distribution for the number of parallel tasks per job varies with time. Two gang ...
متن کاملParallel Virtual Memory for Time Shared Environments
This paper analyses the issues involved in providing virtual distributed shared memory for time-shared parallel machines. We study the performance of two di erent page management policies, namely, static and dynamic page allocation under two widely accepted scheduling policies: Gang scheduling and independent processor scheduling. The performance of each page management policy is studied under ...
متن کاملStochastic Analysis of Gang Scheduling in Parallel and Distributed Systems
Gang scheduling is an approach for resource allocation in parallel and distributed systems that combines time-sharing with space-sharing to ensure a short response time for interactive tasks and high overall system throughput. In this paper, we present queueing theoretic models for a particular gang scheduling system under a workload representative of large-scale engineering and scientiic compu...
متن کاملPerformance Evaluation of Gang Scheduling for Parallel and Distributed Multiprogramming
In this paper we explore the performance of various aspects of gang scheduling designs. We developed an event-driven simulator of a vanilla gang scheduler that relies on the Distributed Hierarchical Control (DHC) structure. We also developed three variations of the vanilla gang scheduler that rely on a push-down heuristic and on two job-migration schemes to decrease response times by reducing p...
متن کامل